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Abstract. Wc consider the spread of epidemics in technological and social networks. How do 
people react? Does awareness and cautious behavior help? We analyze these questions and present 
a dynamic model to describe the movement of individuals and/or their mobile devices in a certain 
(idealistic) urban environment. Furthermore, our model incorporates the fact that different locations 
can accommodate a different number of people (possibly with their mobile devices), who may pass 
the infection to each other. We obtain two main results. First, we prove that w.r.t. our model at 
least a small part of the system will remain uninfected even if no countermeasures are taken. The 
second result shows that with certain counteractions in use, which only influence the individuals' 
behavior, a prevalent epidemic can be avoided. The results explain possible courses of a disease, 
and point out why cost-cfBcicnt countermeasures may reduce the number of total infections from a 
high percentage of the population to a negligible fraction. 
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1 Introduction 

How can I protect myself if an epidemic outbreak occurs? This question concerns us all. Individuals 
protect themselves by avoiding contacts to infected people, companies exploit all possibilities to keep 
their employees viable, and governments try to ensure the public health and safety. However, in all these 
cases one must take into account that we live in a mobile society, and prohibiting personal contacts 
between individuals - which define a so called social interaction network - is not desirable. 

Since the beginning of time, pcioplc inhabited different areas and moved with their tribes when needed. 
Although this behavior changed over time, traveling became more and more popular for various reasons. 
Nowadays, even the mobility within a single city is extremely high [14]. Our society relies on delivery 
systems, personal services of different kind, and close co-operation. However, mechanisms that are able 
to control the spread of epidemics (even just within the urban area of a large city) are quite expensive. 
Protection systems and an enormous amount of mostly expensive antidotes are needed, which may not 
be available when a yet unknown disease appears. Moreover, the immunization of the majority of the 
population is most likely not feasible. 

The main question is, how can the society react if an epidemic outbreak occurs? In a developed 
country there is a certain budget for health care. The amount provided by this budget is surely finite, 
but one may assume that it ensures the medication of at least a small fraction of the population. Hence, 
one can ask whether we can embank an epidemic by using only our limited resources. 

Such a resource may be the use of modern media to change the individuals' behavior. One can assume 
that in a modern civilization, individuals are well connected to various types of information sources such 
as newspaper, television, and the Internet. Since these powerful and effective tools are already established 
and often used, a government can easily warn the population. From this time on (for the duration of the 
epidemic) the majority of the informed people will act more carefully. 

On the technological site, we also have to deal with the problem of epidemics. Nowadays, plenty of dif- 
ferent devices provided with microprocessors are guiding us through the day. For many of us these became 
a permanent assistant like smart phones and notebooks. Different devices with similar communication 
interfaces are able to form a so called ad-hoc network. In this context Bluetooth plays an important role, 
and we refer to the network defined by interactions between various devices equipped with this technology 
as a technological network. Although Bluetooth is quite useful, it also became more and more attractive 
to people willing to exploit its weaknesses. Many different attacks were developed and several weaknesses 
have been exploited since the appearance of this technology [25, 11]. One of the main problems consists 
for example in the possibility to crack the PIN needed to establish a connection between two devices 
[33]. Currently, it is even possible to infect e.g. a smart phone even without the active assistance of the 
owner by using malware like worms [11]. Such a compromised device can be used to automatically infect 
other Bluetooth devices in its current vicinity (and in the ad-hoc network to which it belongs). Even 
though in new Bluetooth versions several security issues have been fixed, many devices which are in use 
remain vulnerable. However, the entire replacement of those may take years and result in huge expenses 
for consumers. 

Motivated by the fact that mobile devices always follow the movement pattern of their owners, we 
present a model, which describes the dynamic behavior of individuals in a certain urban environment. 
Furthermore, we utilize proper parameters to cover the main characteristics described above [12]. In the 
rest of this paper, the terminology follows the usual notions known from the field of epidemic diseases, and 
we do not distinguish between the spread of epidemics in social interaction and technological networks. 
We consider the following models (cf. Section 2 for a more formal description) . 

(A) Emerging country model: In this model we assume that the medical equipment is rather limited. 
Thus, an epidemic may survive for a long time. Furthermore, we assiime that the warning possibilities 
are also very limited. Media like television or the Internet does not exist or provides rather small benefit. 
In other words, this type of model provides only negligible possibilities for disease control. The most 
interesting question is whether it is possible for the population to survive despite these circumstances. 

(B) Industrialized country model: Compared to the emerging country model, we now assume a signif- 
icantly larger budget for disease control. This assumption implies a few facts. First, the government is 
able to cure a large number of individuals simultaneously. Furthermore, the health care system is capable 
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of taking people into quarantine to protect the rest of the population, and the media is also omnipresent. 
Thus, a message communicated via newspaper, television, or the Internet will be received by the whole 
population within a short time. The main question is whether these possibilities provide a positive impact 
on the embankment of an epidemic. 

1.1 Related Work 

One of the most important processes analyzed on social networks (in the usual sense) is the spread 

of diseases. There is plenty of work considering epidemiological processes in different scenarios and on 
various networks. In this subsection, we only describe the papers which are closely related to our results. 

The simplest model of mathematical disease spreading is the so called SIR model (see e.g. [20, 30]). 
The population is divided into three categories: susceptible (S), i.e., all individuals which do not have 
the disease yet but can become infected, infective (I), i.e., the individuals which have the disease and 
can infect others, and recovered (R), i.e., all individuals which recovered and have permanent immunity 
(or have been removed from the system) . Most papers model the spread of epidemics using a differential 
equation based on the assumption that any susceptible individual has uniform probability /3 to become 
infected from any infective individual. Furthermore, any infected player recovers at some stochastically 
constant rate 7. 

This traditional (fully mixed) model can easily be generalized to a network. It has been observed 
that such a case can be modeled by bond percolation on the underlying graph [18, 29]. Callaway et 
al. [7] considered this model on graph classes constructed by the so called configuration model (i.e., a 
random graph with a given degree distribution). The SIR model has also been analyzed in some other 
scenarios, including various kinds of correlations between the rates of infection or the infectivity times, 
in networks with a more complex structure containing different types of vertices [29], or in graphs with 
correlations between the degrees of the vertices [27]. Interestingly, for certain graphs with a power law 
degree distribution, there is no constant threshold for the epidemic outbreak as long as the power law 
exponent is less than 3 [30] (which is the case in most real world networks, e.g. [13, 1, 3, 32]). If the 
network is embedded into a low dimensional space, or it has high transitivity, then there might exist a 
non-zero threshold for certain types of correlations between vertices. However, none of the papers above 
considered the dynamic movement of individuals, which seems to be the main source of the spread of 
diseases in urban areas [12]. 

Borgs ct al. [6] focused on how to distribute antidote to control epidemics. The authors analyzed 
a variant of the contact process in the susceptible-infected-susceptible (SIS) model on a finite graph in 
which the cure rate is allowed to vary from one vertex to the next. That means the rate at which 
an infected node v becomes healthy is proportional to the amount of antidote it received, given a fixed 
amount of antidote R = J2xev whole network. The authors studied contact tracing on the 

star graph and the distribution of the antidote proportional to the node degree on expander graphs and 
general graphs with bounded average degree as curing mechanisms. They state that using contact tracing 
on a star graph would require a total amount of antidote which is super-linear in the number of vertices. 
Here, contact tracing on a graph means that the cure rate is adjusted to py = p-\- p'd*,{t) at every time t, 
where d*{t) denotes the number of infected neighbors of v at time t and p > is some c;onstant. Prom the 
point where the number of infected leaves is oj{p/(3), the epidemic can not be prevented anymore with 
high probability, even if the amount of antidote is /3n^/^~°(^\ where (3 describes the probability of a node 
to become infected by a neighbor. However, setting proportional to the degree requires an amount 
of antidote that scales only linearly with the number of vertices, even on general graphs, provided the 
average degree is bounded. On the other side, even if the underlying graph is an expander, then curing 
proportional to the degree cannot reduce the needed amount of antidote by more than a constant factor. 

In [12], Eubank et al. modeled physical contact patterns, which result from movement of individuals 
between specific locations, by a dynamic bipartite graph. The graph is partitioned into two parts. The 
first part contains the people who carry out their daily activities moving between different locations. 
The other part represents the various locations in a certain city. There is an edge between two nodes, 
if the corresponding individual visits a certain location at a given time. Certainly, the graph changes 
dynamically at every time step. 
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Eubank et al. [12, 9] analyzed the corresponding network for Portland, Oregon. According to their 
study, the degrees of the nodes describing different locations follow a power law distribution with exponent 
around 2.8^. For many epidemics, transmission occurs between individuals being simultaneously at the 
same place, and then people's movement is mainly responsible for the spread of the disease. 

The authors of [12] also considered different countermcasures in order to avoid an epidemic outbreak. 
They stated that early detection combined with targeted vaccination are effective ways of defense com- 
pared to mass vaccination of a population. However, in most cases it is not possible to find the individuals 
having many acquaintances, and in many cases vaccinations cannot even be applied (e.g., SARS, or swine 
flu in its early stages). 

In addition to the theoretical papers described above, plenty of simulation work has been done. Two 
of the most popular approaches are the so called agent-based and structured meta-population-based, 
respectively (cf. [2, 21]). Both models have their advantages and weaknesses. The main idea of the meta- 
population approach is to model whole regions, e.g. georeferenced census areas around airport hubs [4], 
and connect them by a mobility network. Then, within these regions the spread of epidemics is analyzed 
by using the well known mean field theory. In contrast, the agent-based approach models individuals 
with agents in order to simulate their behavior. In this context, the agents may be defined very precisely, 
including e.g. race, gender, educational level, nutritional status, age, priority groups, participant class, etc. 
[22, 23], and thus provide a huge amount of detailed data conditioned on the agents setting. Furthermore, 
these kind of models are able to integrate different locations like schools, theaters and so on. Thus, an 
agent may or may not be infected depending on his own choices and the ones made by agents in his 
vicinity. The main issue about the agent-based approach is the huge amount of computational capacity 
needed to simulate huge cities, continents or even the world itself [2]. This limitation can be attenuated by 
reducing the number of agents, which then entails a decreasing accuracy of the simulation. In the meta- 
population approach the simulation costs are lower, sacrificing accuracy and some kind of noncoUectable 
data. 

To combine the advantages of both systems, hybrid environments were implemented (e.g. [5]). The 
main idea of such systems is to use an agent-based approach at the beginning of the simulation up to some 
point where a sufficient number of agents are infected. Then, the system switches to a meta-population- 
based approach. Certainly, such a system combines the high accuracy of the agent-based simulations at 
the beginning of the procedure with the faster simulation speed of the meta-population-based approach 
at stages, in which both systems seem to provide similar predictions. 

Nonetheless, these kind of simulations confirm the positive impact of non-pharmaceutical countermea- 
sures, which is underpinned by examinations on real data (e.g. [26]). Germann et al. [17] investigated the 
spread of a pandemic strain of influenza virus through the U.S. population. They used publicly available 
2000 U.S. Census data to identify seven so-called mixing groups, in which each individual may interact 
with any other member. Each class of mixing group is characterized by its own set of age-dependent 
probabilities for person-to-person transmission of the disease. They considered different combinations of 
socially targeted antiviral prophylaxis, dynamic mass vaccination, closure of schools and social distancing 
as countermcasures in use, and simulated them with different basic reproductive numbers Rq. It turned 
out that specific combinations of the countermeasures have a different influence on the spreading process. 
For example, with _Ro — 1-6 social distancing and travel restrictions did not really seem to help, while 
vaccination limited the number of new symptomatic cases per 10,000 persons from ^ 100 to ^ 1. With 
Rq = 2.1, such a signiflcant impact could only be achieved with the combination of vaccination, school 
closure, social distancing and travel restrictions. In [24] Liu et al. examined the influence of two parame- 
ters, the decay rate of the disease and the range a person can be infected in, on the spread of an epidemic 
in the urban environment of the Haizhu district of Guangzhou. The results imply the importance of both 
parameters. Especially the results of the distance parameter, which is influenced by peoples behavior, 
imply significant impact on the disease spreading if manipulated wisely. 

Concerning mobility in a 2D field, Valler et al. [34] analyzed the epidemic threshold for a mobile 
ad-hoc network. They showed that if the connections between devices is given by a sequence of matrices 
Ai,...At, then for As < 1 no epidemic outbreak occurs, with high probability, where Xs is the first 



^ In [12] the degree represents the number of individuals visiting these places over a time period of 24 hours. 
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eigenvalue of n'^^{l — 5)1 + 13 Ai with /? and S being the virus transmission probabiUty and the virus 
death probability, respectively. They also approximated the epidemic threshold for different mobility 
models in a predefined 2D area, such as random walk, Levy flight, and random waypoint. 

However, realistic scenarios do not only consider the spreading process itself. In reality, we are influ- 
enced by many factors, e.g. the awareness about an epidemic. In [15], Funk et al. analyzed the spread 
of awareness on epidemic outbreaks. That is, the information about a disease is also spread in the net- 
work, and it has its own dynamic. In [15] the authors described the two spreading scenarios (awareness 
vs. disease) by the following model. Each individual has a level of awareness, which depends on the num- 
ber of hops the information has passed before arriving to this individual. This was combined with the 
traditional SIR model. It has been shown that in a well mixed population, the spread of awareness can 
result in a slower size of outbreak, however, it will not affect the epidemic threshold. Nevertheless, if the 
spread of information about a disease is considered as a local effect in the proximity of an outbreak, then 
awareness can completely stop the epidemic. The impact of spreading awareness is even amplifled if the 
social network of infections and informations overlap. 



1.2 Our Results 

In our dynamic model we integrate the results of [12], and assume that every individual chooses a 
location independently and uniformly at random according to the power law degree distribution of the 
corresponding places. This is the flrst analytical result on the spread of epidemics in a dynamic scenario, 
where the impact of the power law distribution describing the attractiveness of different locations in an 
urban area is considered. 

First we show that in the emerging country model it is very unlikely for a (deadly) epidemic to wipe 

out the whole population. This holds due to the decreasing number of survivors and infected people over 
time. That is, the (infected and uninfected) population size is decreasing over time while the available 
space for each individual is not. This implies a decreasing probability for two individuals to meet, since 
the available space is large enough for the healthy individuals to avoid the infected ones (cf. Section 3.1). 
This provides analytical evidence in our model for a conjecture expressed in a historical documentation 
about the plague in the mid ages [16]. 

Second we show that in the industrialized country m,odel the; use of news services combined with an 
appropriate health care budget limits the number of infected persons to a negligible fraction. Due to the 
warnings, the population will act more carefully and the corresponding power law exponent increases. 
That is, a noticeable number of people avoid locations with plenty of individuals. Simultaneously, the 
number of accommodated persons in these locations decreases, as well as the probability for the remaining 
visitors to become infected. 

The model of this paper seems to be completely different from most of the models considered in 
Subsection 1.1. In many of these papers, the authors used the well known mean field theory to model 
the spread of epidemics in urban areas (e.g. census areas around airports [4]) or within different mixing 
groups [17]. Then, infected individuals may pass the disease to every other member of their community, 
with a certain probability. In our model, the infection is only transmitted between individuals being in the 
same cell at a certain time step, which implies completely different results depending on the distribution 
of individuals among the cells. Concerning the geographic mobility models (e.g. [34]), there are several 
interesting results w.r.t. different mobility patterns for the individuals, such as Levy fiight or random 
waypoint. In all these cases, free movement in a 2D simulation field is considered, where the spacial 
distribution of the random walk or Levy flight mobility models are uniform, and the distribution of nodes 
in the neighborhood of an infected device is Gaussian. In our paper, we try to integrate the distribution of 
the attractiveness of different locations in an urban area, which seems to follow a power law distribution 
[12]. 

The main disadvantage of our model is that we do not take the personal preferences of different 
people into account, and we ignore any dependency between certain choices (e.g., a married couple is 
in many cases together at the same place). Nevertheless, since most of the above dependencies in real 
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world are positively correlated , the results should even be stronger in more realistic scenarios. This 
also seem to hold for periodic mobility [34]. It would also be interesting to integrate the Levy flight 
model to some extent, although this in general does not hold in urban environments [14]. Nevertheless, 
even the simple mobility model analyzed in this paper provides evidence for the positive impact of non- 
pharmaceutical interventions (e.g. school closures) and cautious behavior (public gathering bans as well 
as public warnings), which has already been observed in real world studies [26]. 

2 Model and annotation 

In this section we present the formal model used in this paper. 

Modeling the environment To model the environment we use a static grid structure of size ^/ku x ^/kti, 
where k is a constant. That is, our grid contains nn so called cells. The cells represent physical locations an 
individual can visit, e.g., a restaurant, the office, or a concert hall. Each cell may contain nodes (also called 
individuals), depending on its so called attractiveness. In reality there are different places with varying 
attractiveness in an urban area [12]. The attractiveness d of a cell v is chosen randomly with probability 
proportional to l/cZ", where a is a constant larger than 2 (according to [12], a « 2.8 for locations 
in Philadelphia). We bound the highest attractiveness to y/nn. Since the objects move randomly, the 
expected number of nodes contained in a specific cell with attractiveness d is given by n- d/ c^i, 

where c is some normalizing constant such that X^i^l" '^fs^ = This scenario describes e.g. a city with 
different locations, and individuals visiting these locations according to their attractiveness. 

Modeling moving individuals In the grid structure described above there are n nodes (or individuals), 
which move from one cell to another in each step. Each node chooses the target location in some step 
independently and with probability proportional to its attractiveness. Now, assume that an infection 
starts to spread among the nodes. To model the spreading process, we use three different states, which 
partition the set of nodes in three groups; X{j) contains the infected nodes in step j, U{j) contains the 
uninfected (susceptible) nodes in step j (i.e. subject to infection but not infected already) and TZ{j) 
contains the resistant nodes in step j (i.e., nodes which became cured and can not be infected anymore). 
Whenever it is clear from the context, we simply write I, U, and TZ, respectively. If at some step j, an 
uninfected node i visits a cell which also contains a node of I(j), then i becomes infected and carries the 
disease further^. 

Possible variations In our model, there are only three variables which can vary. One is a, which represents 
the probability distribution of the attractiveness of different locations. Another is k, which describes the 
total number of cells, i.e., the locations the nodes may visit. In addition, the time until an infected 
individual is cured again can also vary. That is, a time period r is assigned to the epidemic, which means 
that an individual is infective for t consecutive steps. With r very large (i.e., w(logn)) we obtain the 
model without any recovery. 

Parameters In the cinicirging country model we assume 2 < a < 3, k = 1 and t = o(logn). In contrast, 
due to the countermeasures applied in the industrialized country model, we assume that a, k, t are large 
constants there. In both models an uninfected node becomes infected with probability 1, as soon as an 
infected node is accommodated in the same cell. 



* That is, persons who meet at some place will more likely meet at (other) places too. On the other side, it is 
not very likely that persons who have not met within a certain time frame, will meet each other afterwards. 

^ This model can easily be extended to the case, in which the disease is transmitted according to a given 
probability. 
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3 Analysis 

We start our analysis with some basic observations. 

Observation 1 The expected number of nodes choosing a specific cell with attractiveness d is proportional 
to d. 

Observation 2 The probability for a node to choose an arbitrary cell with attractiveness d is proportional 
to d-°'+K 

As we can see, the number of cells with a high attractiveness decreases with an increasing a. On the 

other side, while k increases, the area, in which infected nodes may infect other nodes, decreases. Then, 
the probability for two nodes to choose the same cell decreases too. 

3.1 Emerging country model 

One major concern about the breakout of a deadly disease is that the epidemic may wipe out the majority 
of the population. At the beginning of the outbreak people become infected. Then, they carry the disease 
and distribute it among the individuals they meet at different locations. Let us assume a non curable 
course of disease, in which the infected individuals decease after some time period. At some time, there 
is only a small fraction of the population which is still alive, and some of them still carry the infection 
further. We model this situation (at some time step j) assuming that \T{j) \ ■ < n"^"^, for some small 

constant e. The most interesting question is whether these nodes manage to infect the remaining healthy 
nodes, and exterminate the whole population. 

Lemma 1. Let ■ |W(j)| < n^^, where e is an arbitrarily small constant, k is a constant and r is 

a slow growing function in n (i.e., r = o{\ogn)). Then in a round there is no newly infected node with 
probability 1 — n~^^^h 

Proof. Let a pairing of two nodes i G i' G U{j) describe the event that i and i' choose the same cell. 
Let Xiy^d be the event that nodes i and i', where i ^ i' , choose the same (specific) cell with attractiveness 
d. Then, the probability Pr{xi^i'^d) is bounded by 

'izi^) -»(!)) ) 4'^^^^^- 

Further, let Xj^j' be the event that node i and i' meet in an arbitrary cell. Then, Pr{xiy) < c'^Pr{xi^i'^4), 
and we obtain 

P..,,,.(i^)|=^f.^*^^ 

Now, a fixed uninfected node i' becomes infected with probability at most \I{j)\Pr{xi^i') and the 
expected number of newly infected nodes is bounded by 

K(a-2) (2^-"f 
H<mj)\\imPr{x.,,) < ^ J±_,/ (1 + 0(1))- 



Since the nodes of U{j) are assigned to the cells independently, we use Chernoff bounds [8] to obtain 
the desired result. With (1 + 5)/i = 1 and X being the random variable describing the number of newly 
infected nodes, we obtain 
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□ 

In the next theorem, we show that at least a polynomial fraction of the population remains uninfected, 
even if no countermeasures are taken. 

Theorem 1. Let k = 1 andletr be a slow growing function in n (i.e., t = o{logn)). Then, a polynomial 
fraction of the population remains uninfected, when the spreading process runs out. 

Proof. We analyze the procedure in three phases. The first phase contains only the increase in the 
number of infected nodes. However, we will ensure that a sufficient number of uninfected nodes will still 

be present, where this number may be very small compared to the size of the population. In the second 
phase we show that either the spreading process runs out at some time when the number of uninfected 
nodes is polynomial in n, or we will have a situation where the assumptions of Lemma 1 are fulfilled. 
Consequently, we apply Lemma 1 in the third phase. Let i denote the current time step, where it holds 
i = at the beginning, and let ti,t2 describe the step after the corresponding phases 1 and 2 have ended. 

Phase 1 For this phase we assume, that no node becomes cured. Thus, all infected nodes remain infected 

during this phase, and carry the disease to all nodes they meet in the cells. According to the power law 
distribution of the attractiveness, a constant fraction of cells will have attractiveness d = 2. Remember 
that K = 1, and hence the number of all cells is n in total^. The number of cells not hosting any infected 
node can be modeled by a simple balls into bins game, and we obtain that the number of such cells is 
0(n) [31]. Then, the probability that an uninfected node remains uninfected in one step is also a constant. 
Since the uninfected nodes are assigned to the cells independently, we apply Chernofl[ bounds to conclude 
that as long as \U{i)\ = n®^^\ at least a constant fraction of U{i) remains uninfected after step i, with 
probability 1 — n~^^^\ 

Phase 2 We now add the curing procedure to our analysis. Thus, [R-{ti + i)\ will increase in each 
step. However, we know from the first phase that \U{i + 1)| = 0{\U{j)\) for any j. Then, the number 
of uninfected nodes will not decrease below some polynomial in n, or there is some step j such that 
U 7^(j)| ~ n — n'^ and \U{j) \ = for some e' > small enough. Assuming that r = o(logn), after 
r steps it holds that n{i + r) = I{i) U n{i) and \U{i + r)| = \U{i)\/n°'-'^\ Thus, 

\I{i + r)\-\U{i + T)\& 

where we assumed that X{i + r) ^ 0. Thus, there is some constant e > small enough, such that 
\I{i + T)\-\U{i + T)\=n''\ 

Phase 3 Since at this point |I(i2)| • 1^(^2)1 < ri^^, we apply Lemma 1 for r steps, and obtain the theorem. 

□ 

Theorem 1 implies that if r = o(logn), the disease will run out, and a polynomial fraction of the 
population survives. This is surprising, given the fact that an aggressive virus with a long time frame for 
transmitting the infection (i.e., r unbounded) is spread among the individuals of a population. Theorem 
1 seems to explain the behavior of certain epidemics known from the history. We know that some of these 
epidemics exterminated a large fraction of the population of various cities. However, a fraction of the 
citizens could always survive (e.g. the plague [16]). 




The result can easily be extended to arbitrary constant k. 
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3.2 Industrialized country model 

In reality the most effective countermeasures against the spread of diseases (except vaccination) seem 
to be warnings and isolation [26]. When a new disease breaks ont, tlic media warns the pubhc and 
informs it about the risks. The most important facts are mentioned, including the ways of transmission 
and possible precautions. A few examples are the bird flu, swine flu, and SARS. In reality it is often not 
possible to isolate infected people at the time they become infected. Thus, such an infected individual 
may potentially infect others until his symptoms show up and he becomes isolated. We assume that such 
a disease has an incubation time not bigger than a constant s, and model this fact by setting t = s. Now 
it remains to include warnings spread through the media into our model. A warning basically affects the 
constants a and k, since the individuals will most likely avoid places with a large number of persons, 
waive needless tours, and be more careful when meeting other people. Although these modifications alone 
are most likely not sufficient, we show that a combination of these strategies, which are able to sufficiently 
influence the constants a and k, are enough to stop the spread of the disease in our model. Therefore, 
we assume in our analysis that a and k are large constants. 

Lemma 2. Let the set Gk contain all cells with attractiveness 2^ up to 2^'^^ — 1 and let |X(j)| = /'(n) 
for a specific step j, where f{n) is some function with lim„^oo f{n) = oo and q > 3 constant. If k < 

(^J_2) ■ log ^^^671 the number of newly infected nodes in Gk is bounded by O ^ (2'°-i^)"-3 ) 

probability 1 - ^nuM) ■ 

Proof. According to our assumption, the expected number of infected nodes at the beginning of step j, 
which choose cells in group Gk with k < ^^^^2) ' ( Wnl) ' ^ 

9fc+l_-, 

V \m\d 

To prove the lemma, we formulate the problem as a vertex exposure martingale [28]. Let Za^b be the event 
that node a and b choose the same cell. We define a graph G = {V, E) by setting V = U L, where 
L = {!,..., n- and T(j) n L = 0. The set of edges is E = {{x^,l) \ € T(j) A I € Lhz^.^i}. Let 

now be the vertex exposure sequence given by xi, ...,xx(j) € ^ij)- Thus, each Xi represents an infected 
node which may establish edges connecting Xi to the set L. By standard Chernoff bounds [8, 19] it follows 
that if a is large enough, i.e., 2*= 2V(°-2)-i°g(^«V/3(")) ^ o(/(n)), then an infected node has 0{f{n)) 
edges with probability 1 — i/e^'J''")' By using the union bound, we may conclude that with probability 
1 — ^n(fM) ^■ll infected nodes placed in cells of the group Gk have 0{f{n)) edges. 

Now we restrict the probability space to the events, in which O (^ (2''-r)°-^ ) infected nodes choose cells 

in Gfc, and each of these nodes has 0{f{n)) edges. We obtain such an event with probability 1 — e~^''^^'^h 
Let Xq,Xi, ... be the vertex exposure martingale, where Xj is the expectation on the number of edges 
incident to V mG^(j), conditioned by the knowledge of the edges incident to Xi, . . . ,Xi, where lokU) is 
the set of infected nodes choosing cells in Gk- Note that we only expose the infected vertices lying in Gk- 
Given the assumption above, for any k < (^^35) ' ( (n j ) ^® have \Xk — E[Xk]\ < 0{f{n)). Then, 

^Kl^|x.,0)|l - Xol > A) < 2 . e-^I^I^TH^ < exp (-^^^g^y), 

with A = K,(2^-i)°'-'^ — ^ /t"^ ■ Si'^ce the expected number of uninfected nodes in a cell of Gk is less than 
2*^+1 - 1, we obtain Xq = O ( (aJ^f^) • (2*^+^ - 1) = O ( (^J^ifi-a ) , and the lemma follows. □ 

Theorem 2. Let an epidemic disease be spread in a y/nn x -i/ivn network as described in section 2- 
Furthermore, let t = s, where k, a, and s are some large constants. Then, the network is healthy again 
after 0((loglogn)^) steps, with probability 1 — o(l). Moreover, when 1=0, the set TZ has size log*^^^^ n. 
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Proof. The proof consists of three parts. In the first part we show that the number of infected nodes 

decreases after s steps by at least a constant factor, with probability 1 — 2s|X(j)|~^/'?, where g is a 
constant. The second part states a result about the oscillating behavior (w.r.t. the number of infected 
nodes) during the whole process. Finally the third part shows, that if the number of informed nodes is 
just small enough (although dependent on n), it is sufBcicnt to consider ©((log log n)^) additional steps to 
eliminate the remaining infected nodes, with probability 1 — log~^'^^''(n). Then, all parts together imply 
the validity of the theorem. 

Part 1 Let u he a proper upper bound on the number of newly infected nodes in the network. This 
bound holds with probability 1 — 0(|I(j) |~^/^) at step j and will be computed later. By setting r = s the 
infected nodes will carry the infection further only for a constant time s. Let a super-step be a sequence 
of s consecutive steps. We know that after a super-step the nodes which were infected at the beginning 
of this super-step, become cured (i.e., they are moved to the set TV). Let i > s be an arbitrary time step. 
Then, 

< - B)|(l + Vf - - 5)1 < |I(i - S)| ((1 + Vf - 1) . (1) 

To obtain a value for u we ask how many different cells become occupied by the infected nodes at 
most. Therefore, we group the cells with respect to their attractiveness. The group Gk contains all cells 
with attractiveness 2*^ up to 2*^+^ — 1. Let Xij describe the number of nodes, which are infected by i e X(j) 

in step j, and define Xj J2viei(j) ^^^^ assume that = log^ n. Note that if a is large 

enough (cf. Lemma 2), the expected number of infected nodes contained in all the cells of attractiveness 

1 lor;( 1^0)1 ") o 

at least 2°"^ Viog-^ "7 is bounded by ©(log n), with probability 1 — p where p is some proper 

constant. If the attractiveness is larger than I = '"^('■^'^•'^l ^ '"), then the expected number of 
infected nodes in all these cells together is o(log~^ n). 

Let n be the expected number of infected nodes contained in Gk- Note that E^^n decreases with 
the number of infected nodes. According to Lemma 2, if f(n) = logn, then the number of newly infected 

nodes in Gk in some step j is O (^j2^zrijl^^ j with probability 1 — n~^^^\ Thus, we obtain 

^((^^3ljL)+^(Vn)-(2.|-l) 

< c' . (^1 + ^^^p^^ = c' . (1 + 0(1)) , (2) 

with probability 1 — o(log~''' n). Note that considering the attractiveness up to I only is crucial here, since 
no infected node is in some cell with attractiveness larger than I with probability 1 — o(log~^ n). In this 
context c. is a small constant depending on a and k. For decreasing |X(j)| we obtain the general formulas 
where log(n) is replaced by f{n) in the statements above. Note that at this point one can represent |X(j)| 
as f'^{n). Then inequality (2) holds with probability 1 — Thus, u = d with the probability 

given above. 

Then, it holds that 

((1 + .)^ - 1) = t ( Q 1- V'^) -l = ± Q 1-'=.'= < ^^^^IpP < .se^ « 1, (3) 
if c's « 1/e. Here, the first inequality in (3) holds using the following estimation from [19] 

Then (1), (2) and (3) imply a decreasing number of infected nodes after each super-step by a factor of 
vse, with probability at least 1 — 2s|I(j)|~^/'. 
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Part 2 The results so far imply an oscillating behavior of the number of infected nodes. That is, the set I 
will mainly decrease (cf. Part 1). However, the number of infected nodes may also increase in some time 
steps. If one infected node chooses a destination with plenty of nodes from U, then I increases drastically. 
Thus, it remains to compute the probability for such an event. 

Now we divide the whole process into phases. Lets consider each phase separately, and assume that 
a specific phase begins at time step j. A phase consists usually of s steps, and a step is divided into two 
substeps. In the first substep of a step j', we allow each infected node to transmit the disease to every 
other node being in the same cell. In the second substep, all nodes which were infected in step f — s are 
moved to TZ, and stop transmitting the disease in the subsequent steps. 

Now, a phase starting at some time step j ends after s steps, if in all these steps j' < s it holds that 
< ^^<^\^ij)\- This holds with probability at least 1 - 25[X(j)[^^/* (see above). If in some 
step j' < s + j') \ X{j)\ > vse\X{j)\, then this phase ends, and in the next step we start with a new 
phase. 

Now we model the process as a special random walk (cf. Figure 1). For this, let G' = {V',E') be 
a directed graph, where V = {loglogloglogn, . . . , log'' n}. A node v G V corresponds to the case 
|I| = V. From each v, there is a transition to maxjumin, j/sew} with probability 1 — 2sv~^^'^, where 
Vmin = log log log log n (all the other transitions are not relevant and can be arbitrary). In order to show 
that within 0((loglogn)'^) steps the vertex Vmin is visited (log log n)^ steps, we state the following lemma. 

Lemma 3. Let (Xt)^-^ be a Markov chain with space {!,... ,m} that fulfills the following property: 

— there are constants C\,C2 < 1 such that for any t e IN, PrfX^+i < ciX^ > 1 — X^'^^. 

Let T = min{t e M | = 1}. Then, 

Pr[T = 0{\og^ m)] > 1 - m"^. 

Proof. The proof follows from some of the arguments of Claim 2.9 from [10]. However, there are two 
main differences. First, the probability of not moving toward the target vertex (which is m in their case 
and 1 in our case) is in [10] exponentially small w.r.t. the current state ^Xi. while we only have some 
polynomial probability in X^^. Second, in their case the failure in moving toward the traget vertex most 
likely occurs at the other end of the graph (close to 1), while in our case a failure mainly occurs at some 
state, which is close to the target. 

We define a step to be decreasing, if X^+i < CiX^. Furthermore, a decreasing step is successful, if 
= 1 after some X^ > 1. Let F be a random variable which denotes the number of consecutive 
decreasing steps without reaching a successful step. Note that t = logi/d upper bound on Y since 

X,<c;*^^'-"m=l. 

We divide now the Markovian process Xt into consecutive epochs. Every time we fail to have a 
decreasing step, we start a new epoch. Furthermore, if a step is successful, we stop. Now we show that an 
epoch is successful with some constant probability. Let Pi be the probability that some epoch i contains 
a successful step. Then, 



Pi> \{ (l - (cjm)-"^) > \[ = p- 



logl/ci 

n 

j=0 3=0 

where p is a proper constant such that 1 < p < (1 — c1^)~'^^ . Then for the exponent a we have 



logi //■. rn — 1 logi /^^ m— 1 , 

a- V (c'ru)-"'' - V ^ ' ' - / 1 - - - -0(1) 

f^o ~ H ~ (l-Vc?)m-= " (1/cf - l)m<== " 

Let T' be the first epoch in which we reach a successful step. As in [10], we know that an epoch has 
such a step with constant probability, and hence Pr\T' = O(logm)] > 1 — m~*. Since an epoch can 
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Fig. 1. Model of the random walk for part 2 of Theorem 2, where c = (use) ^ . Thereby the edges were partially 
plotted for the node i. 



only last for ©(log to) steps, we obtain that a successful step is reached within 0{\og^ m) steps, with 
probability 1 — to^^. □ 

According to the lemma above, in the graph G' , node Umin is visited at least once within 0((log log n)^) 
phases, with probability 1 — log~^'^ n. Furthermore, if = log* n for some j, then + s)| = 

^se|I(j)| with probabihty 1 — 2slog^^n. Therefore, using the union bound we conclude that t^min is 
visited (log log n)^ times by the random walk within ©((log log n)"*) phases, without visiting log'' n twice 
in any two consecutive phases, with probability 1 — o(log^'^'^^ n). 

Part 3 In order to conclude, we observe that within one step, \T\ does not increase to some value larger 
than log' n with probability 1 — o{\og^^ n) (cf. inequality (2)). Furthermore, given that \I\ is always 
smaller than log'' n, we know that within ©((loglogn)"*) steps, with probability 1 — o(log^'''^^ n) there 
are ©(log^logn) time steps, in which the number of infected nodes is at most log log log log n. To show 
that the disease becomes eliminated from the system, we consider the probability for these nodes not to 
meet any other node for r consecutive time steps. 

To show our claim, we only consider time steps in which the number of infected nodes is at most 
log log log log 71. First, we assign the uninfected nodes to the cells. As in Lemma 1, the probability for a 
node to choose a specific cell with constant attractiveness is proportional to Thus, at least a constant 
fraction of the cells will remain empty, since the probability for a cell with constant attractiveness to be 
empty is (1 — c(iyK„)" ~ e~^^^^ Let t = log^ logn be the number of so called phases, where each phase 
consists of T steps. Then, an (infected) node chooses an empty cell for r consecutive steps with probability 
e-'S'('^) and all nodes of I choose empty cells in r consecutive steps with probability e^^^'^'-^^-'^l^. Then, 
in all t phases there is at least one node w e X which does not choose an empty cell in at least one of the 
steps with probability 

(l - ^ -P {-^mkm) ^ n. (4) 

Hence, after ©(log'* logn) steps there is at least one phase, in which all nodes of I spend t consecutive 
steps alone in some cells, with probability 1 — log"^'*-* n. 

Thus, the network becomes completely healthy after ©((log logn)"*) steps with probability 1 — o(l), 
and the theorem follows. □ 



4 Conclusion 

We presented a model, which describes a simple movement behavior of individuals as well as the impact 
of certain countermeasures on the spread of epidemics in an urban environment. Two different parameter 
settings were used for the analysis. In the first case the epidemic can spread nearly unhindered. The 
obtained result for this case implies that w.r.t. our model a part of the population will survive with 
probability 1 — o(l). In the second case the epidemic is combated by public warnings, isolation and 
(limited) medications. One can observe, that in this case the epidemic is embanked after a short time 
with probability 1 — o(l). Furthermore the number of total infections decreases from a high percentage 
of the population to a negligible fraction. 

Nevertheless, several open questions remain. In our model, we assumed that every node chooses a cell 
with attractiveness d with probability proportional do dT"^^^ . However, different individuals may have 
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different preferences which are not included in our analysis. Furthermore, different types of movement 
models arc coiKxnvabk; like Levy flight, periodic mobility model, and grid like movement. Although all 
these characteristics are not considered in this paper, our methods and techniques might be useful to 
analyze more realistic movement models in the future. 
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